智能论文笔记

Learning Shape Control of Elastoplastic Deformable Linear Objects

Rita Laezza , Yiannis Karayiannidis

分类：机器人

2022-08-03

长期以来，可变形的物体操纵任务被视为具有挑战性的机器人问题。但是，直到最近，对这个主题的工作很少，大多数机器人操纵方法正在为刚性物体开发。可变形的对象更难建模和模拟，这限制了对模型的增强学习（RL）策略的使用，因为它们需要仅在模拟中满足的大量数据。本文提出了针对可变形线性对象（DLOS）的新形状控制任务。更值得注意的是，我们介绍了有关弹性塑性特性对这种类型问题的影响的第一个研究。在各种应用中发现具有弹性性的物体（例如金属线），并且由于其非线性行为而挑战。我们首先强调了从RL角度来解决此类操纵任务的挑战，尤其是在定义奖励时。然后，基于差异几何形状的概念，我们提出了使用离散曲率和扭转的固有形状表示。最后，我们通过一项实证研究表明，为了成功地使用深层确定性策略梯度（DDPG）成功解决所提出的任务，奖励需要包括有关DLO形状的内在信息。

translated by 谷歌翻译

On the Importance of Clinical Notes in Multi-modal Learning for EHR Data

Severin Husmann , Hugo Yèche , Gunnar Rätsch , Rita Kuznetsova

分类：机器学习

2022-12-06

Understanding deep learning model behavior is critical to accepting machine learning-based decision support systems in the medical community. Previous research has shown that jointly using clinical notes with electronic health record (EHR) data improved predictive performance for patient monitoring in the intensive care unit (ICU). In this work, we explore the underlying reasons for these improvements. While relying on a basic attention-based model to allow for interpretability, we first confirm that performance significantly improves over state-of-the-art EHR data models when combining EHR data and clinical notes. We then provide an analysis showing improvements arise almost exclusively from a subset of notes containing broader context on patient state rather than clinician notes. We believe such findings highlight deep learning models for EHR data to be more limited by partially-descriptive data than by modeling choice, motivating a more data-centric approach in the field.

translated by 谷歌翻译

Turning the Tables: Biased, Imbalanced, Dynamic Tabular Datasets for ML Evaluation

Sérgio Jesus , José Pombal , Duarte Alves , André Cruz , Pedro Saleiro , Rita P. Ribeiro , João Gama , Pedro Bizarro

分类：机器学习

2022-11-24

Evaluating new techniques on realistic datasets plays a crucial role in the development of ML research and its broader adoption by practitioners. In recent years, there has been a significant increase of publicly available unstructured data resources for computer vision and NLP tasks. However, tabular data -- which is prevalent in many high-stakes domains -- has been lagging behind. To bridge this gap, we present Bank Account Fraud (BAF), the first publicly available privacy-preserving, large-scale, realistic suite of tabular datasets. The suite was generated by applying state-of-the-art tabular data generation techniques on an anonymized,real-world bank account opening fraud detection dataset. This setting carries a set of challenges that are commonplace in real-world applications, including temporal dynamics and significant class imbalance. Additionally, to allow practitioners to stress test both performance and fairness of ML methods, each dataset variant of BAF contains specific types of data bias. With this resource, we aim to provide the research community with a more realistic, complete, and robust test bed to evaluate novel and existing methods.

translated by 谷歌翻译

RITA: Boost Autonomous Driving Simulators with Realistic Interactive Traffic Flow

Zhengbang Zhu , Shenyu Zhang , Yuzheng Zhuang , Yuecheng Liu , Minghuan Liu , Liyuan Mao , Ziqing Gong , Weinan Zhang , Shixiong Kai , Qiang Gu

分类：人工智能 | 机器人

2022-11-07

High-quality traffic flow generation is the core module in building simulators for autonomous driving. However, the majority of available simulators are incapable of replicating traffic patterns that accurately reflect the various features of real-world data while also simulating human-like reactive responses to the tested autopilot driving strategies. Taking one step forward to addressing such a problem, we propose Realistic Interactive TrAffic flow (RITA) as an integrated component of existing driving simulators to provide high-quality traffic flow for the evaluation and optimization of the tested driving strategies. RITA is developed with fidelity, diversity, and controllability in consideration, and consists of two core modules called RITABackend and RITAKit. RITABackend is built to support vehicle-wise control and provide traffic generation models from real-world datasets, while RITAKit is developed with easy-to-use interfaces for controllable traffic generation via RITABackend. We demonstrate RITA's capacity to create diversified and high-fidelity traffic simulations in several highly interactive highway scenarios. The experimental findings demonstrate that our produced RITA traffic flows meet all three design goals, hence enhancing the completeness of driving strategy evaluation. Moreover, we showcase the possibility for further improvement of baseline strategies through online fine-tuning with RITA traffic flows.

translated by 谷歌翻译

Adaptation of Autoencoder for Sparsity Reduction From Clinical Notes Representation Learning

Thanh-Dung Le , Rita Noumeir , Jerome Rambaud , Guillaume Sans , Philippe Jouvet

分类：机器学习 | 自然语言处理

2022-09-26

在处理小型数据集上的临床文本分类时，最近的研究证实，经过调整的多层感知器的表现优于其他生成分类器，包括深度学习。为了提高神经网络分类器的性能，可以有效地使用学习表示的功能选择。但是，大多数特征选择方法仅估计变量之间的线性依赖性程度，并根据单变量统计测试选择最佳特征。此外，学习表示所涉及的特征空间的稀疏性被忽略了。目标：因此，我们的目标是通过压缩临床代表性空间来访问一种替代方法来解决稀疏性，在这种情况下，法国临床笔记也可以有效地处理有限的法国临床笔记。方法：本研究提出了一种自动编码器学习算法来利用临床注释表示的稀疏性。动机是通过降低临床音符表示特征空间的维度来确定如何压缩稀疏的高维数据。然后在受过训练和压缩的特征空间中评估分类器的分类性能。结果：建议的方法为每种评估提供了高达3％的总体绩效增长。最后，分类器在检测患者病情时达到了92％的准确性，91％的召回，91％的精度和91％的F1得分。此外，通过应用理论信息瓶颈框架来证明压缩工作机制和自动编码器预测过程。

translated by 谷歌翻译

Negation, Coordination, and Quantifiers in Contextualized Language Models

Aikaterini-Lida Kalouli , Rita Sevastjanova , Christin Beck , Maribel Romero

分类：自然语言处理 | 人工智能

2022-09-16

借助情境化语言模型的成功，许多研究探讨了这些模型真正学到的知识，并且在哪些情况下仍然失败。这项工作的大部分都集中在特定的NLP任务和学习成果上。很少的研究试图使模型的弱点与特定任务的弱点相结合，并专注于嵌入本身及其学习方式。在本文中，我们抓住了这一研究机会：基于理论语言见解，我们探讨了功能词的语义限制是否是学习的，以及周围环境如何影响其嵌入。我们创建合适的数据集，为LMS VIS-VIS功能单词的内部工作提供新的见解，并实施辅助视觉网络界面以进行定性分析。

translated by 谷歌翻译

What Did I Just Hear? Detecting Pornographic Sounds in Adult Videos Using Neural Networks

Holy Lovenia , Dessi Puji Lestari , Rita Frieske

分类：人工智能

2022-09-08

基于音频的色情检测可以通过利用不同的光谱特征来实现有效的成人内容过滤。为了改善它，我们根据不同的神经体系结构和声学特征探索色情声音建模。我们发现，经过对数频谱图训练的CNN可以在色情800数据集上实现最佳性能。我们的实验结果还表明，对数MEL频谱图可以为模型识别色情声音提供更好的表示。最后，为了对整个音频波形进行分类，而不是段，我们采用了投票段到原告技术，从而产生最佳的音频级检测结果。

translated by 谷歌翻译

Temporal Label Smoothing for Early Prediction of Adverse Events

Hugo Yèche , Alizée Pace , Gunnar Rätsch , Rita Kuznetsova

分类：机器学习

2022-08-29

可以提前以低虚假警报率预测不良事件的模型对于接受医学界的决策支持系统至关重要。这项具有挑战性的机器学习任务通常仍被视为简单的二进制分类，并提出了一些定制方法来利用样本之间的时间依赖性。我们提出了时间标签平滑（TLS），这是一种新颖的学习策略，可调节平滑强度，这是与感兴趣的事件接近的函数。这种正则化技术降低了在类边界上的模型置信度，在该阶级边界中，信号通常是嘈杂或不信息的，因此训练可以集中在远离该边界区域的临床信息丰富的数据点上。从理论的角度来看，我们还表明，我们的方法可以作为多屈曲预测的扩展，这是在其他早期预测工作中提出的学习启发式词。 TLS从经验上匹配或跑赢大盘，考虑了各种早期预测基准任务的竞争方法。特别是，我们的方法可显着提高与临床相关的指标的性能，例如以低弹药率以较低的事件召回。

translated by 谷歌翻译

HTML版本

Visual Comparison of Language Model Adaptation

Rita Sevastjanova , Eren Cakmak , Shauli Ravfogel , Ryan Cotterell , Mennatallah El-Assady

分类：人工智能

2022-08-17

神经语言模型被广泛使用；但是，它们的模型参数通常需要适应时间和资源消耗的应用程序的特定域和任务。因此，最近引入了适配器作为模型适应的轻巧替代方案。它们由一组特定于任务的参数组成，这些参数缩短了训练时间和简单的参数组成。适配器训练和组成的简单性带来了新的挑战，例如保持适配器属性的概述，并有效地比较其生产的嵌入空间。为了帮助开发人员克服这些挑战，我们提供了双重贡献。首先，在与NLP研究人员的密切合作中，我们对支持适配器评估的方法进行了需求分析，并检测到了对固有的（即基于相似性的嵌入相似性）和外部（即基于预测的）解释方法的需求。。其次，在收集的要求的激励下，我们设计了一个灵活的视觉分析工作空间，可以比较适配器属性。在本文中，我们讨论了几次设计迭代和替代方案，以进行交互式，比较视觉解释方法。我们的比较可视化表明，适应性嵌入媒介的差异和对各种人性化概念（例如，人的名字，人类素质）的预测结果。我们通过案例研究评估我们的工作空间，并表明，例如，根据Context-0（deNsTextualized）嵌入对语言偏见任务进行培训的适配器，引入了一种新型的偏见，其中单词（甚至与性别独立的单词）一样与女性代词更类似于女性。我们证明这些是上下文0嵌入的工件。

translated by 谷歌翻译

Boosting Modern and Historical Handwritten Text Recognition with Deformable Convolutions

Silvia Cascianelli , Marcella Cornia , Lorenzo Baraldi , Rita Cucchiara

分类：计算机视觉

2022-08-17

自由图页中的手写文本识别（HTR）是一项艰巨的图像理解任务，可以为手写文档的数字化和重复使用其内容提供相关的增强。由于写作风格的变化和页面质量降解的变化，该任务在处理历史文档时变得更加具有挑战性。最先进的HTR方法通常将序列建模的复发结构与卷积神经网络进行视觉特征提取。由于卷积内核是在固定网格上定义的，并专注于所有输入像素时在输入映像时独立地独立于所有输入像素，因此该策略无视手写字符在形状，比例和规模和方向上，即使在同一文档中，并且墨水像素为比背景更相关。为了应对这些特定的HTR困难，我们建议采用可变形的卷积，这可能会根据手头的输入而变形，并更好地适应文本的几何变化。我们设计了两个可变形的架构，并在现代和历史数据集上进行了广泛的实验。实验结果证实了可变形卷积对HTR任务的适用性。

translated by 谷歌翻译